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(54) Digital image scrambling for image coding systems 

(57) Methods and apparatus for encryption and 
decryption of digital images are disclosed. A preferred 
embodiment operates on an image frame after that 
frame has undergone a_space-frequency transform 
operation, such as a block DCT or wavelet transform^ 
and before the frame is passed to a bitstream coder for 
entropy coding. The transform coefficient map is sub- 
jected to one or more encryption operations that render 
a subsequently decoded (but not decrypted) image 
incomprehensible. These operations are designed to 
operate with low computational overhead and with only 
minor effects on compressed bit rate. They also allow 
secure transcoding at intermediate routers of the trans- 
mission channels without the cryptographic key. 

In one operation, the sign bits of transform coeffi- 
cients are scrambled. In another operation, two dimen- 
sional blocks of coefficients from a common subband 
are shuffled and/or rotated to pseudorandom locations 
and orientations. In yet another operation, coefficients 
occupying a common "subband", but taken from differ- 
ent DCT blocks, are shuffled. Still another operation 
shuffles motion vectors and/or scrambles sign bits for 
motion vector coefficients. These operations perturb the 
data as it will appear visually, without greatly perturbing 
the entropy of the data as presented to an entropy 
coder. 
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Description 

FIELD OF THE INVENTION 

5 [0001 ] This invention pertains generally to digital imaging, and more particularly to digital image scrambling. 
BACKGROUND OF THE INVENTION 

[0002] Digital images, including digital video, are often communicated or distributed over non-private channels, 

10 such as satellite links, cable television networks, wireless home networks, and the Internet. Conditional access systems 
for private digital image/video transmission or storage are a necessity for many applications, for example, pay-TV, con- 
fidential videoconferences, confidential facsimile transmissions, and medical image transmission and storage in a data- 
base. Digital cryptography techniques must be used in conjunction with non-private channels if unauthorized parties are 
to be prevented from gaining access to such private imagery. 

75 [0003] Video scramblers are commonly employed to prevent unauthorized access to image data. Several video 
scrambling systems rely on methods of directly distorting the visual image data such that, without descrambling, the 
video appears unintelligible to a viewer. For example, U.S. Patent 4,100.374, issued July 11, 1978, to N. Jayantand S. 
Kak, and entitled "Uniform permutation privacy system", describes an approach where a video signal is divided into 
groups of N successive video samples, and samples within a group are then permuted. U.S. Patent 5,321 ,748, entitled 

20 "Method and apparatus for television signal scrambling using block shuffling", issued June 14, 1994, to D. Zeidler and 
J. Griffin, describes an alternate approach where blocks of video lines and lines within a block are shuffled. In U.S. Pat- 
ent 5,815,572, entitled "Video scrambling", and issued Sept. 29, 1998, to G. Hobbs, the approach includes a combina- 
tion of video permutation modes, including line reversal, line inversion, line permutation and block (of lines) permutation, 
where the combination of modes used changes as time progresses. These methods have several drawbacks, including: 

25 1) they can severely degrade the compressibility of the images; and 2) they are vulnerable to code-breaking attacks 
because of the highly spatially- and temporally-correlated nature of video sequences. 

[0004] In many systems for scrambling digital images, the images are first subject to compression, and then the 
compressed image data is treated as ordinary data and is encrypted/decrypted using traditional cryptographic algo- 
rithms such as the Digital Encryption Standard (DES). See H. Pinder and M. Palgon, "Apparatus and method for cipher 

30 stealing when encrypting MPEG transport packets," U.S. Patent 5,684,876, Nov. 4, 1997; N. Katta et. al, "Scrambled 
transmission system," U.S. Patent 5,621,799, Apr. 15, 1997. Due to the high data rate of video (even compressed 
video), these methods add a large amount of processing overhead to meet a real-time video delivery requirement. To 
reduce the amount of processing overhead, several researchers have proposed selective encryption of MPEG com- 
pressed video data. See T. Maples and G. Spanos, "Performance study of a selective encryption scheme for the secu- 

35 rity of networked, real-time video," Proc. 4 th Inter. Conf. Computer Communications and Networks, Las Vegas, Nevada 
(Sept. 1995); J. Meyer and F. Gadegast, "Security mechanisms for multimedia data with the example MPEG-1 video," 
http://www.es. tuberlin.de/phade/Dhade/secmpea.html (1995). For example, in selective encryption, only the entropy- 
coded I frames, or the entropy-coded I frames and Intra-coded blocks of predictive (P/B) frames may be encrypted. I. 
Agi and L. Gong showed in "An empirical study of secure MPEG video transmissions," The internet Society Symposium 

40 on Network and Distributed System Security (Feb. 1 996), that in some cases the encryption of I frames alone does not 
provide sufficient security. These systems use the known synchronization word used for improving the error immunity 
of digital transmission systems. For this reason, they may be vulnerable to possible attacks on plain text data or block 
marks that are often used in compression systems. To selectively encrypt some segments of the compressed data such 
as Intra blocks sometimes incurs additional header overhead to locate such segments (see, e.g. , Meyer and Gadegasfs 

45 method). In addition, this classical approach is not very secure for transcoding at intermediate routers of the transmis- 
sion channel because the transcoder must be able to decrypt. 

[0005] Other systems use more elaborate means to distort video images. B. Macq and J. Quisquater propose, in 
"Digital images multiresolution encryption", J. Interactive Multimedia Assoc. Inteil. Property Pro/., vol. 1 , no. 1 , pp. 1 79- 
1 86 (Jan. 1 994), a three-step process for scrambling an image. The image is first transformed by a "Linear Multiresolu- 

so tion Transform" (LMT) proposed by the authors. Selected rows and columns of the transformed image are then shuffled. 
The shuffled transform image is then subjected to an inverse LMT prior to transform and bitstream coding. A decoder 
reverses these steps to restore the original image. Although this method is less vulnerable to code-breaking attacks, 
and can provide a level of transparency (e.g., a degraded version of the original image is visible in the scrambled sig- 
nal), it still has disadvantages— 4he two additional transforms required at each end add complexity, and image com- 

55 pressibility is still adversely affected. 

[0006] One researcher proposes performing one or more of a group of shuffling operations on the Discrete Cosine 
Transform (DCT) coefficients of an image. L. Tang, "Methods for encrypting and decrypting MPEG video data effi- 
ciently," Proc. The Fourth ACM International Multimedia Conference (ACM Multimedia '96), pp. 219-229, scrambles 
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each of the 8x8 blocks of DCT coefficients obtained during MPEG transform coding, before the coefficients are input to 
the MPEG entropy coder. This scrambling may entail 1) shuffling the AC coefficients within each block, 2) shuffling the 
AC coefficients using two shuffle tables (with a second random variable determining which shuffle table to apply to each 
block), 3) grouping the DC coefficients from eight blocks and encrypting the group with DES, and 4) splitting the DC 

5 coefficient from each block into two DC bit patterns, placing one of these in the last AC coefficient position of the block, 
and then scrambling all coefficients for the block. Although these techniques are not complex and provide a reasonable 
level of security, they change the statistical properties (e.g., the run-length characteristics) of the DCT coefficients. As 
a result, they may increase the bit rate of the compressed video by as much as 50%. This approach is also not very 
secure for transcoding at intermediate routers because the cryptographic key is needed to decrypt before requantiza- 

w tion. 

SUMMARY OF THE INVENTION 

[0007] It is recognized herein that digital image encryption presents a set of issues, aside from security, that are 
. is unique in the data cryptography field. A digital image scrambling scheme should have a relatively simple implementa- 
tion, amenable to low-cost decoding equipment and low-delay requirement for real-time interactive applications. It 
should have a minimum adverse impact on the compressibility of the image. It should preferably be independent of the 
bitstream compression selected for the image, and allow compression transcoding without decryption. It should provide 
good overall security, although it may also be preferable in some systems to allow non-authorized users a level of trans- 
20 parency, both to entice them to pay for full transparency, and to discourage code-breaking. 

[0008] The present invention provides digital image scrambling that meets the objectives outlined above. It is appar- 
ently the first digital image scrambling approach that can meet each of these objectives without compromise. Preferably, 
the invention accomplishes these objectives by operating on transformed images, prior to Huffman, run-length, arithme- 
tic, embedded, or other entropy coding. The encryption/decryption operations performed by the invention are designed 
25 to preserve, as much as possible, the transformed image properties that allow entropy coders to efficiently compress 
an image. And the preferred encryption operations are computationally inexpensive operations, such as block shuffling 
and bit-scrambling on a subset of bits. 

[0009] In accordance with a first aspect of the invention, a method of encrypting a digital image is disclosed. The 
method includes applying a space-frequency transform to the image, thereby generating a transform coefficient map. 

30 The map is then encrypted using one or more encryption techniques selected from the following: scrambling the sign 
bits of the coefficients in the map; scrambling the refinement bits of the coefficients; partitioning the map into a set of 
two-dimensional coefficient blocks of motion vectors and a predicted frame for P or B frames collected by the slice of 
image data and shuffling selected blocks within the map; and grouping a set of transform coefficients collected by the 
slice of image data from a spatial frequency subband and shuffling the transform coefficients within the group. 

35 [0010] In a second aspect of the invention, several methods of encrypting a digital image are disclosed. In one 
method, a group of bits are selected across a block of data, the group having lower than average predicted compress- 
ibility, as compared to the predicted compressibility of the block of data as a whole. These bits are then scrambled. In 
a second method, a motion-compensation data component of a digital video steam is selectively scrambled. 
[001 1 ] In accordance with another aspect of the invention, an image encryption system is disclosed. The system 

40 comprises an encryption buffer that accepts transformed image data, along with at least one encryption subsystem 
operating on transform data stored in the buffer. The subsystem(s) can include a sign bit scrambler, a block shuffler, a 
block rotator, and a subband coefficient shuffler. The system may further comprise a quantizer and/or an entropy coder 
that operates on encrypted transform data. 

[0012] In a further aspect of the invention, an encrypted image decryption system is disclosed. The system com- 
45 prises a decryption buffer that accepts encrypted transform data, along with at least one decryption subsystem operat- 
ing on encrypted transform data stored in the buffer. The subsystem(s) can include a sign bit descrambler, a block 
deshuffler, a block derotator, and a subband coefficient deshuffler. The system may further comprise an entropy 
decoder and/or a dequantizer that operates on entropy coded encrypted transform data. 

50 BRIEF DESCRIPTION OF THE DRAWING 

[001 3] The invention may be best understood by reading the disclosure with reference to the drawing, wherein: 

Figure 1 shows a prior art MPEG video coder; 
55 Figure 2 shows data organization for an MPEG video frame; 

Figure 3 illustrates DCT and transform coefficient ordering for an MPEG video block; 
Figure 4 shows a prior art MPEG video decoder, 

Figures 5 and 6 show, respectively, simplified block diagrams for an image coder and an image decoder according 
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to the invention; 

Figure 7 shows confidence interval trends for DCT coefficient magnitude as a function of spatial frequency; 
Figure 8 illustrates sub-band ordering for DCT coefficients from the luminance component of an image slice; 
Figures 9 and 10 illustrate subband shuffling techniques according to an embodiment of the invention; 
5 Figure 1 1 depicts the subband organization for a wavelet transform coefficient map; 
Figure 12 illustrates a process for shuffling subbands of a wavelet transform; 
Figure 13 illustrates a process for scrambling bits of a group of coefficients; 

Figures 14 and 15 illustrate, respectively, a video coder and a video decoder according to embodiments of the 
invention; 

10 Figures 1 6 and 1 7 illustrate, respectively, an encrypter and a decrypter according to embodiments of the invention; 
and 

Figures 18 and 19 illustrate the performance of the invention. 
DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

15 

[0014] The preferred embodiments are disclosed below as applied 1) to a DCT-based image codec, such as those 
set forth in the JPEG, MPEG-1 , MPEG-2, and H.26X standards, and 2) to a wavelet-based image codec. These embod- 
iments were selected by way of illustration and not by way of limitation. Indeed, the disclosed embodiments apply 
equally to other image codecs that exhibit the properties exploited in the present invention. 

20 [001 5] Several terms appearing in this disclosure have defined meanings. A space-frequency transform represents 
an image as a set of coefficients, each coefficient containing both spatial frequency and spatial location information. 
Block-based spatial frequency transforms and wavelet transforms are examples. A transform coefficient map contains 
space-frequency transform coefficients. Although typically stored in a two-dimensional array, the map can practically be 
stored in any desired format. The definition of a map includes sub-maps and space-frequency-time transform coeffi- 

25 cient maps. 

[0016] Shuffling refers to a process that randomizes the order of its input to produce a re-ordered output. Scram- 
bling refers to a process that randomizes its input in any manner to produce an output. A key refers to any symbol or 
device that allows a user to access an encryption/decryption sequence. 

[0017] Figure 1 shows the general architecture for an MPEG-like video coder 30. An input image stream is divided 
30 into /, R and B frames for input to the system. / (intracoded) frames are directly coded, and P (predicted) and B (bidi- 
rectional^ predicted) frames are partially indirectly coded using information from other frames. An operator may select 
the frequency of /, P, and B frames in the image sequence, with the restriction that at least some / frames must be used. 
An / frame and its dependent P and B frames are generally referred to as a group of pictures (GOP). 
[0018] DCT 32 operates on 8x8 pixel blocks 56a of an input image (see Figure 2). At the input to DCT 32, input 
35 image frame 50 is divided into horizontal slices 52a-f (the number of slices shown is chosen for illustration, and is not 
fixed in general) for processing. For the luma component of a color image, each slice is 1 6 pixels wide. The correspond- 
ing chroma components of the image are sampled at half the spatial frequency of the luma component, such that a 
chroma slice is 8 pixels wide. Each slice (see slice 52a) is further partitioned into macroblocks 54a-f (the number of 
macroblocks shown is chosen for illustration, and is not fixed in general). Each macroblock contains six blocks (e.g., 
40 blocks 56a-f), such that the first four blocks 56a-d together cover a 16x16 pixel area from the luma component of the 
current slice, and the fifth and sixth blocks 56e and 56f cover corresponding 8x8 areas taken respectively from the two 
chroma components of the slice. 

[0019] Figure 3 illustrates the operation of a DCT 32 that outputs block coefficients in zigzag order. DCT 32 per- 
forms a two-dimensional discrete cosine transform on 8x8 pixel block 56a to produce a corresponding 8x8 block of 
45 transform coefficients 60. The upper-leftmost coefficient DC represents the avenge intensity of block 56a. As one 
moves down and/or right in coefficient block 60, the spatial frequencies represented by the coefficients increase. Thus 
the zigzag order, indicated by the numbering of the coefficients in block 60, approximately orders the coefficients from 
lowest to highest spatial frequencies. 

[0020] Once the coefficients of block 60 are arranged in zigzag order, quantizer 34 of Figure 1 scales the coeffi- 
so cients (note that zigzag ordering can also be performed after quantization). The DC coefficient quantizer step size may 
be fixed. The coefficients are quantized to a scale commensurate with their range of values. 
[0021 ] Bitstream coder 36 may treat the DC coefficients differently also. Within each slice, the DC coefficients may 
be differentially-coded and transmitted using a variable-length code. The remaining 63 coefficients, together with the 
DC coefficient in some cases, are run-length encoded to take advantage of the sparse population of non-zero coeffi- 
55 cients in a typical block 60, particularly at the highest frequencies. The bitstream output of bitstream coder 36 com- 
prises a block-by-block coding as described, with headers inserted at the macroblock, slice, frame, and group of 
pictures level. 

[0022] At the video frame input to coder 30, the group of pictures sequence is used to determine whether the next 
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incoming frame will be an /, P, or B frame. / frames are input directly to DCT 32 (note that a JPEG coder processes 
single image frames in a manner similar to / frame video processing in coder 30). P and B frames are not input directly 
to DCT 30, but instead go through a prediction channel that attempts to exploit the temporal redundancies found in most 
video sequences. 

5 [0023] Motion compensator 44 attempts to match the blocks of a P or B frame with the blocks of a prediction frame 
or frames. For instance, the first P frame following an / frame is predicted from that / frame. The quantized / frame 
appearing at the output of quantizer 34 is "decoded" by an inverse quantizer 40 and an inverse DCT 42 to represent the 
/ frame as it will be seen by a decoder operating on the bitstream output of coder 30. Motion compensator 44 attempts 
to find a best fit prediction for each macroblock of the P frame, based on the quantized prediction frame. The offset from 

10 the macroblock location to the prediction location with the best fit is described by a motion vector. In some cases (such 
as where a new object is introduced to the scene) prediction can be poor, and motion compensator 44 opts not to pre- 
dict that macroblock, but to let it be intracoded like an / frame instead. 

[0024] Motion compensator 44 produces two outputs for each input P or 8 frame: a set of motion vectors and a pre- 
dicted frame. The motion vectors are supplied to bitstream coder 36 for output coding. The predicted frame is sub- 
is traded from the input P or B frame in image adder 38 to form a residual frame. The residual frame is then input to DCT 
32 in the same manner as an / frame. 

[0025] Figure 4 shows a video decoder 62 appropriate for decoding a bitstream produced by video coder 30. A bit- 
stream decoder 64 recovers the transform coefficient and motion vector information from the coded bitstream. The 
transform coefficient information is passed through inverse quantizer 40 and inverse DCT 42. The / frames are fully 
20 reconstructed at this point, and can be output as well as fed to motion compensator 66. Motion compensator 66 con- 
structs prediction frames using the motion vector information and appropriate / and P frame data. Image adder 68 com- 
bines prediction frames with residual frames to reconstruct P and B frames. 

[0026] Figures 5 and 6 show, respectively, general block diagrams for an image encrypter and coder 70 and an 
encrypted image decoder 80 according to the invention. In coder 70, an encrypter 74 is inserted between image trans- 
25 form 72 and bitstream coder 76. In decoder 80, a corresponding decrypter 84 is inserted between bitstream decoder 
82 and inverse image transform 86. 

Encryption Methods 

30 [0027] In most prior art image encryption, scrambling is performed either prior to image transformation or subse- 
quent to bitstream coding. Although one researcher (L. Tang, "Methods for encrypting and decrypting MPEG video data 
efficiently," discussed in the Background of the Invention) performs scrambling between image transformation and bit- 
stream coding, his method differs from the present invention significantly, such that most of the advantages of the 
present invention are not found in Tang's method. 

35 

Subband shuffling 

[0028] The present invention includes two general sub-methods of encryption, each based on the recognition of a 
different characteristic of transform coefficient data. The first sub-method recognizes that shuffling the arrangement of 

40 coefficients in a transform coefficient map can provide effective security without destroying compressibility, as long as 
the shuffling does not destroy the low-entropy aspects of the map relied upon by the bitstream coder. The second sub- 
method recognizes that although wholesale encryption of individual transform coefficients is generally undesirable 
(because coefficient encryption adds complexity and destroys the compressibility of the low-entropy coefficient data), 
some bits of individual transform coefficients have high entropy and can thus be encrypted without greatly affecting 

45 compressibility. 

[0029] Several examples will illustrate how the present invention shuffles transform coefficients without destroying 
compressibility. Figure 7 shows a hypothetical a priori confidence interval (bounded by lines 90 and 92) for quantized 
DCT coefficients, as a function of spatial frequency. After fixed quantization, higher frequency terms are much more 
likely to fall below the half-LSB cutoff line 93 than are low frequency terms— consequently, there is a much higher like- 
so lihood that such terms will be represented as a zero by the coder. 

[0030] Most MPEG-type bitstream coders rely on the statistics of an average coefficient block to provide efficient 
coding. Note that after zigzag ordering, the coefficients are arranged approximately in increasing frequency. The bit- 
stream coder uses a variable-length codeword run-length coding technique that generally assigns shorter codewords 
to combinations of coefficient values and run lengths that are more likely, based on the concepts illustrated in Figure 7. 
55 Thus, the shorter codewords tend to favor runs followed by small coefficients. 

[0031 ] In the coefficient shuffling method proposed by Tang, the zigzag coefficient ordering is destroyed. This gen- 
erally shortens avenge run-lengths and places some large coefficients in unlikely places in the coding order. As a result, 
the run-length coder will not operate efficiently. With Tang's method, up to 50% increases in bit rate are observed, 
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mainly due to this effect. 

[0032] The present invention includes a coefficient shuffling method that provides effective scrambling without 
destroying the statistics relied upon by a run-length coder. In one embodiment illustrated in Figure 8, a slice 94 of a DCT 
coefficient map is input to the coefficient shuffler. The blocks are re-arranged in zigzag order across rows, and with the 
blocks stacked down columns as shown in map 96. 

[0033] With the DCT coefficients arranged as shown in map 96, it can be appreciated that each column represents 
the same spatial frequency, as measured at different 8x8 spatial locations in the original image slice. Although the coef- 
ficients in a given column of map 96 are not expected to have identical values, they should have a similar a priori sta- 
tistical distribution. Thus the coefficients in the column can in many cases be re-shuffled without significantly degrading 
the statistics relied upon by a run-length coder. 

[0034] In one embodiment, map 96 is divided into "subbands" of coefficients with similar spatial frequency magni- 
tude. Although subbands can be as small as a single column, one convenient subband division (shown in Figure 8) 
groups coefficients along one or more diagonals of the original coefficient block (corresponding to one or more "zigs" 
and "zags") together. 

[0035] The coefficients in each subband are shuffled within that subband. Shuffling tables will generally be different 
for different subbands and for the same subbands of different slices. Figure 9 shows an example of subband shuffling 
for a particular subband 100 containing coefficients A through X. The subband is passed to a subband coefficient shuf- 
fler 98, along with a key. The key is used to create a shuffling map (alternately, the shuffling map can be supplied directly 
to shuffler 98). Shuffler 98 uses the shuffling map to produce a shuffled subband. In a simplified embodiment, subband 
coefficients taken from the same block remain together after shuffling, producing a shuffled subband such as subband 
102. This allows shuffling map size to be independent of subband width. In a more complex embodiment, coefficients 
are shuffled without limitation, producing a shuffled subband such as subband 104. 

[0036] Figure 10 shows an even simpler subband shuffling approach. The shuffler is essentially reduced to a sub- 
band rotator 106 that uses a small set of possible shuffle outputs, with the key being used to select the shuffle table. For 
example, the possible shuffle results may be limited to one of four values with a two-bit key— e.g., half -shifting the coef- 
ficients downwards (output subband 116), flipping the coefficients vertically (output subband 114), flipping them hori- 
zontally (output subband 112), or flipping them in both directions (output subband 1 10). Generally, a small number of 
shuffle permutations will still render an unintelligible inverse-transformed image (without deciphering), although the per- 
mutations that must be attempted by a code breaker are reduced. 

[0037] This same shuffling concept can be equally applied to other types of image coders, for example, a wavelet 
transform coder. A wavelet transform coder separates an image into subbands representing different spatial frequen- 
cies, with each subband retaining the spatial arrangement of the original image (but at a different resolution). Figure 1 1 
shows ten subbands (LH1-3, HL1-3, HH1-3, and LL3) that represent a three-level wavelet decomposition of an input 
frame obtained by separable wavelet filtering along the rows and columns of an input frame. 
[0038] Like in the DCT-based transform discussed above, the statistics of the coefficient distribution generally differ 
from subband to subband. Also, because the coefficients of the subbands are arranged in the spatial arrangement of 
the original image, neighboring coefficient correlation exists that can be exploited by a bitstream coder. The goal of the 
present invention is to provide a coefficient shuffling method that does not destroy these statistical properties. 
[0039] In one embodiment, each subband is considered separately for shuffling. Shuffling tables will generally be 
different for different subbands. Each subband is divided into a number of blocks of the same size, for example the six- 
teen blocks A-P shown for subband LH1 in Figure 12. The blocked subband is then input, along with a shuffling key or 
shuffling map, to a block shuffler 122. Block shuffler 122 outputs a shuffled subband 124. 

[0040] Since the scrambling performed by block shuffler 1 22 is block-based, it retains most of the local 2-D statistics 
of the subband signal. Therefore, the negative impact on subsequent statistical coding is minimized, while the visual 
effect of the shuffling on a decoded encrypted image is dramatic. In general, block size can be selected to trade security 
for statistical coding impact, with larger and fewer blocks producing less security but less impact on statistical coding. 
[0041] To further improve security with little impact on statistical coding, shuffled subband 124 can be input, along 
with a shuffling key or shuffling map. to a block rotator 126. Block rotator 126 selects one of eight possible orientations 
(0, 90, 180, and 270 degree rotations for each of the original block and a transposed block) for each block and 
rotates/transposes the block to that orientation, producing rotated and shuffled subband 128. 

Bit Scrambling 

[0042] Several examples will illustrate the second invention sub-method, which scrambles selected bits in the trans- 
form coefficients to encrypt an image. Figure 1 3 shows a table 1 32 of an arbitrary group of eight coefficients values wO- 
w7, each having 7 magnitude bits b0-b6, with b6 being the most significant bit and bO being the least significant bit, and 
a sign bit s. Directly encrypting each coefficient in the table is costly, both in terms of computing power needed to 
decrypt the coefficients, and in terms of compressibility, since encryption randomizes the coefficient values. 
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[0043] Each bit of a coefficient can be viewed as one of three types. Significance bits for a coefficient are the most 
significant bit with a value of 1 , and any preceding bits with a value of 0. These bits limit the magnitude of the coefficient 
to a known range. Refinement bits are the remaining magnitude bits, used to refine the coefficient within the known 
range. The sign bit determines whether the known range is positive or negative. 

5 [0044] It is recognized herein that the efficiency of a bitstream coder differs depending on the bit type being coded. 
Most transforms create a large number of coefficients having small magnitude, meaning that a significance bit is much 
more likely to have a value of 0 than a value of 1 . Zigzag ordering and wavelet transforms also tend to group small mag- 
nitude coefficients together. Thus the significance bits have relatively low entropy, and are therefore highly compressi- 
ble. On the other hand, most transforms produce coefficients with sign bits that have an approximately equal probability 

10 of being a 1 or a 0, and that are highly uncorrected with the sign bits of neighboring coefficients. Refinement bits also 
tend to have approximately equal probabilities of 1 or 0, and are highly uncorrected with neighboring refinement bits. 
Because of their high entropy (and limited predictability), the sign bits and refinement bits are not highly compressible. 
[0045] In one embodiment, the present invention selects individual non-significance bits from each coefficient and 
scrambles these. Because these bits have limited predictability to star with, scrambling them results in a negligible 

is decrease in bitstream coding efficiency. In Figure 1 3, the coefficients from table 1 32 are supplied to a sign bit scrambler 
130, along with a cryptographic key. The key is used to scramble the sign bits (e.g., by exclusive-ORing the sign bits 
with a pseudorandom bitstream), producing a table 134 of distorted coefficients w0-w7. Roughly half of the coefficients 
in table 134 will have the wrong sign, although a code breaker will not know which ones. Because the sign-inverted 
coefficients distribute their energy over the entire block of pixels they were derived from, sign bit scrambling is quite 

20 effective at producing severe degradation in image quality. 

[0046] In a transform of an image having all positive pixel values, the sign of a low-pass or "DC" coefficient is always 
positive unless the image average is removed from the term. Simply scrambling the sign bit on such a coefficient may 
be an ineffective form of security, since the DC coefficient locations are either known or can be easily guessed at. In this 
case, the "sign" of the term can be toggled by inverting the coefficient magnitude about a predefined value, such as the 

25 half-maximum value for the coefficient. Alternately, if the DC-coefficients are to be differentially coded, the sign bits can 
be scrambled after differential coding. 

[0047] In another embodiment, the refinement bits of the coefficients can be scrambled. This does not provide the 
same level of degradation as sign bit scrambling, because the significance bits and sign bit define the magnitude range, 
after which the refinement bits only add at most plus or minus 33% to the coefficient value. Nevertheless, scrambling 

30 refinement bits adds an additional level of image degadation and security at low added complexity. A refinement bit 
scrambler can be implemented like sign bit scrambler 130. The only difference is that refinement bits do not occupy a 
specific column in table 132. A refinement bit scrambler may thus choose to scramble only the most significant, or the 
two most significant, refinement bits from each coefficient. This latter option would correspond to scrambling the follow- 
ing bits in the specific case of table 132: bits b4 and b5 of coefficient wO; bits b2 and b3 of w1 ; bits b4 and b3 of w2; bits 

35 b0 and b1 of w3 and w4; no bits for w5 and w6; and bit bO of w7. 

[0048] Other forms of selective bit scrambling according to the invention can be devised to work with specific known 
bitstream coders. For example, MPEG 1 transmits DCT coefficients with a known variable-length code based on run 
length and coefficient value. For a given run-length, many coefficient values may produce a variable-length code of the 
same length. Any such coefficient value can be permuted to any other such coefficient value without increasing the 

40 MPEG 1 bitstream coder's bit rate. The previous embodiments enable encryption of space-frequency transforms for still 
images, intra-coded video frames, and residual video frames related to temporal prediction. A further embodiment 
greatly improves the encryption for predicted video, with little penalty in processing power or bandwidth. In this embod- 
iment, motion vector information is scrambled, e.g., using one of the methods described above. 

45 Motion Vector Scrambling 

[0049] Motion compensation creates an array of motion vectors, for example, one vector per macroblock of a frame 
to be coded. These vectors reference a position in a reference frame (e.g., the immediately preceding / frame) having 
the best fit to the macroblock to be coded. A decoder constructs a predicted frame by offsetting into the same reference 
so frame using the motion vectors, extracting pixels from that reference frame at the positions indicated by the motion vec- 
tors, and combining these pixels in a new frame. Thus the predicted frame (and the output frame) can be distorted by 
changing the sign bits of motion vectors (if the motion vectors are to be differentially coded, the sign bits can be scram- 
bled after differential coding), shuffling the motion vectors within the motion vector array, or otherwise distorting the 
motion vectors. 
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Figure 14 shows a video coder 140 according to an embodiment of the invention. Coder 140 is compatible 
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with MPEG video coding, and contains many of the functions found in video coder 30 of Figure 1. But in coder 140, the 
output of quantizer 34 and the motion vector output of motion compensator 44 are fed to encrypter 142 for encryption 
by one or more of the methods disclosed above. After encryption, the encrypted DCT transform coefficients and motion 
vectors are sent to a bitstream coder 144. 

[0051] Although an encrypter can exist as a hardwired sequence of functions, a configurable encrypter 142 can be 
implemented as shown in the block diagram of Figure 16. A data routertouffer 160 accepts transform coefficients, 
motion vectors (if applicable), and one or more cryptographic keys or shuffle tables, and caches these during encryp- 
tion. According to the encryption configuration selected, router/buffer 160 makes data available in an appropriate 
sequence to one or more of the functions connected to router/buffer 160. For example, upon receiving each transform 
coefficient block, the block may first be sent to a sign bit handler 162 and bit scrambler 170 for sign bit scrambling. When 
all blocks of a slice are received and sign bit scrambled, the slice may be directed to subband blocking 168, and then 
one or more of the subband blocks can be sent to coefficient shuffler 1 72. After bit scrambling and coefficient shuffling, 
the buffered slice is output to bitstream coder 144. 

[0052] Decoder 150, and its associated decrypter 154 (Figures 15 and 17), essentially reverse the process to 
recover the transform coefficients and motion vectors as originally supplied to encrypter 142. Decrypter 154 has a data 
router/buffer 180 that performs similar functions as data router/buffer 160. Bit descrambler 190, coefficient deshuffler 
192, block deshuffler 194, and block derotator 196 invert the processes of their corresponding blocks in Figure 16. 
[0053] A prior art decoder, such as decoder 62 of Figure 4, can receive a bitstream produced by video coder 140 
and comprehend it as an MPEG bitstream. But the decoded video signal will appear scrambled. Likewise, a decoder 
150 according to the invention, but without access to the appropriate cryptographic key, can comprehend such a bit- 
stream as an MPEG bitstream but will be unable to descramble the video. A level of transparency can be provided to 
users of prior art decoders and decrypting decoders without an appropriate key, by choosing not to encrypt low-fre- 
quency subband information. These users will be able to view a noisy, low-detail version of the video. Likewise, a clear 
picture may require different keys for different subbands, such that users may have the ability to receive degraded video 
with one key, and clear video if they possess all keys. 

[0054] Another feature of the disclosed embodiments is that an output bitstream can be transcoded without knowl- 
edge of the key. For example, an encrypted output bitstream can be passed through an appropriate bitstream decoder, 
and then through a new bitstream encoder. Alternately, in a coder such as an embedded coder, the output bitstream for 
a frame can be truncated at any point without affecting the ability of a decrypter according to the invention to decrypt 
whatever portion of the bitstream remains. 

[0055] The security of the scrambling process can be analyzed as follows.. For the encryption of the sign bits, if a 
code-breaker is to completely recover a single original, frame, an exhaustive search of 2 M trials is required, where M is 
the number of non-zero coefficients in the frame. For a 512x512 frame, assuming, conservatively, that only 256 non- 
zero coefficients exist, the number of required trials is about 10 75 . If an attacker uses a smoothness constraint in the 
spatial domain to search for the best estimate of the original sign bits, each trial includes an inverse transformation (at 
least a local inverse transformation). Of course, since the encryption of the sign may not render a completely indiscern- 
ible image, an attacker may not make such an effort to recover a perfect image. 

[0056] The next step, block shuffling, will render a completely incomprehensible image, as will be shown in the 
experimental result section. Theoretically, it is very difficult to recover the image frame without knowing the shuffling 
table. Consider a subband that contains 64 blocks. These 64 blocks are shuffled to one of 64! possible permutations. 
Of course, there may be many blocks that contain only zero coefficients, especially for high frequency subbands. 
Assuming there are n zero blocks and all other blocks are different from each other, then the number of different per- 
mutations is 64!/nl. If n=48, then the number of different permutations is about 10 28 , with each permutation requiring 
inverse transforms for all blocks affected by the subband permutation. Given multiple subbands per group of transform 
blocks, multiple groups per frame, and multiple frames per second, it quickly becomes infeasible to perform any appre- 
ciable amount of code breaking on a block shuffled transform image. It should be noted that with wavelet transform 
data, the attacker potentially may try to search for the best estimate directly in the transformed domain by exploiting 
some structure of the coefficient image such as edge continuity. This attack is, however, difficult to construct due to the 
uncorrelated nature of the coefficient image, particularly when there is no prior knowledge about the content of the 
video. Human interaction may be necessary to assist the recovery. That, however, consumes a lot more time for each 
trial, compared to automated recovery by computer. 

[0057] Block rotation further increases the difficulty of recovering an original frame without the key. In this case, 
assuming eight possible ways of rotation, there are 512 (64x8) potential candidate blocks to fill 64 locations. Again, 
assuming there are n zero blocks in the decompressed subband and all other blocks are different from each other, then 
the number of different configurations is 512l/(8n)l, which is significantly larger than 64l/nl. 

[0058] Each disclosed method can be employed individually or in combination, in any preferred order. The shuf- 
fling/rotation tables may not be the same for different video frames. For more secure video transmission, a single key 
can be used to generate a set of different shuffling/rotation tables for scrambling consecutive video frames. More 
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dynamic shuffling/rotation tables make the system more secure, with the tradeoff being a slightly increased complexity. 
The key can also be updated as time progresses to provide a dynamic key-based scrambling system. Known methods 
for key generation, transmission, and usage can be employed in the system. 

[0059] Shuffled blocks can be either contiguous, spatially distributed, or even randomly located throughout a frame. 
5 Contiguous blocks may be preferable, as this tends to reduce the memory requirements of the decoder and latency of 
the system. 

[0060] In general, the scrambling of I frames will render the following P/B frames difficult to perceive due to the 
dependency of P/B frames on I frames. This may lead to the conclusion that P/B frames need not be scrambled. 
Although it may not be necessary to scramble all P and B frames, it is preferable that at least intra-coded blocks of those 
w frames be scrambled, and more preferable that motion vector information be scrambled also. 

Experimental results 

Wavelet-Based System 

15 

[0061 ] The experimental results are reported in tabular and in image format for a set of specific examples. Although 
the images in the Figures 18 and 19 illustrate the performance of the invention, they are not required for one to gain a 
complete understanding of the invention. 

[0062] In the first set of experiments, a five-level wavelet decomposition is performed on an input image frame. The 

20 sign bits of the wavelet coefficients are first encrypted using a sequence of independent identical distributed (i.i.d.) 
pseudorandom bits with equal probability of 1 and — 1 , generated from a given key. The pseudorandom bits are exclu- 
sive-ORed with the original sign bits, and the resulting bits are used as the scrambled signs bits of the coefficients. 
Given the key and the scrambled signs, the original signs can be perfectly recovered by another exclusive-OR with the 
same sequence of pseudorandom bits. Image 1(a) of the Fig, 18 shows an original image, while Image 1(b) shows the 

25 same image after sign encryption and decoding without decryption. Image 1(b) is significantly distorted, but the main 
structure of the image content is still discernible. This encrypted image provides some level of transparency. 
[0063] For more security, blocks of wavelet coefficients are shuffled. For each subband, the coefficients are divided 
into 64 blocks of equal size. For example, if the image size is 512x512, then the highest level subbands will have a size 
of 256x256, and the lowest level subbands will have a size of 16x16. We divided each subband into 64 blocks, yielding 

30 2x2 blocks for the lowest subband and 32x32 blocks for the highest subband. 

[0064] There are many ways to generate the shuffling tables. In these experiments, the following procedure was 

used. The locations of the blocks were numbered 1, 2 64. A [0,1] uniformly distributed pseudorandom number is 

generated using the key as the seed. The interval [0,1] is divided into 64 subintervals 1 -64 of equal length. Suppose the 
random number falls into subinterval y, then the first block will be mapped to the f* location. Then the interval [0,1] is 

35 divided into 63 subintervals of equal length, and a second random number is generated. Depending on which subinter- 
val the random number locates in, the second block will be mapped to one of the remaining 63 locations. This process 
continues until all blocks are mapped. For different subbands, different shuffling tables are generated. If block rotation 
is also employed, the subintervals can each be further subdivided to determine each block's rotation. 
[0065] Image 1(c) shows the image of 1(a) after the transform coefficients have been block shuffled and inverse 

40 transformed. The features of the original frame are virtually unrecognizable. Image 1 (d) shows block rotation alone, and 
image 1(e) shows a combination of sign encryption and block shuffling. Finally, image 1(f) shows the results after a 
combination of sign encryption, block shuffling, and block rotation. Note that although the scrambled images in 1(c), 
1 (e), and 1 (f) are almost equally incomprehensible, the security levels are different. 

[0066] For comparison purposes, image 1(g) shows a version of 1(a) after application of a simple scheme where 
45 lines of wavelet coefficients are shuffled within each subband. The original image has some vertical structure, which the 
line shuffling scheme does not render incomprehensible. 

[0067] The impact of each of these scrambling approaches on the compression efficiency is shown in Table 1 . The 
compression schemes used are state-of-the-art compression schemes— rate-distortion optimized embedded coding 
(RDE) and layer zero coding (LZC). It can be seen in Table 1 that sign encryption alone introduces no loss of the peak 
so signal-to-noise ratio (PSNR). Block shuffling or block rotation introduce only 0.2-0.4 dB loss from the original PSNR (or 
equivalent^, up to a 5% bit rate increase). Similar amounts of PSNR loss are observed for the combination of these 
three strategies. On the other hand, the line scrambling scheme introduces up to 1.1 dB loss of the PSNR, or equiva- 
lent I y P a 22% increase in bit rate. 
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Scrambling Method 


PSNR (dB) at 0.25bpp 


RDE 


LLC 


Lena 
(512x512) 


Barbara 
(512x512) 


Lena 
(512x512) 


Barbara 
(512x512) 


No scrambling 


32.62 


28.67 


32.47 


28.36 


Sign encryption 


32.64 


28.66 


32.46 


28J6 


Block shuffling 


32.24 


28.34 


32.17 


28.19 


Block rotation 


32J5 


28.32 


32.26 


28.24 


Line shuffling 


31.90 


27.54 


31.79 


27.46 


Sign+Block shuffling 


3203 


28.39 


32.17 


2820 


Sign+block shuffling* block 
rotation 


32.27 


28.24 


32.16 


28.18 



Table 1 : Impact of different scrambling techniques on compression efficiency for the 
6 wavelet transform based system. 

25 



8x8 Block-DCT-Based System 

30 

[0068] The proposed scrambling methods are integrated into the H.263 verification model coder maintained by the 
University of British Columbia. In these experiments, the test videos are QCIF size (176x144). For subband shuffling, 
these experiments treat a row of macroblocks as a slice. Coefficients and motion vectors are shuffled within a slice. In 
other words, for each subband (frequency location), 44 (11x4) coefficients from this band of luminance blocks will be 
35 shuffled, and 1 1 coefficients from this band of each chrominance component will be shuffled. Note that we can also 
group the 22 coefficients from a particular band of the two chrominance components together and shuffle them, 
although no results are reported in this section for such a test. The selection of a slice as a unit for shuffling aims to 
restrict the memory requirement for scrambling. 

[0069] To reduce the number of shuffling tables, AC coefficients from some bands are grouped together and shut- 

40 fled using the same shuffling table. In particular, DC coefficients use one shuffling table. The first two AC bands/coeffi- 
cients in the zigzag order share another shuffling table. Then the next three AC bands in the zigzag order share a 
shuffling table; then the next four AC bands share a shuffling table, and so on. In the experimental results reported in 
the following, only the first 45 bands in the zigzag order were shuffled. The other bands were left intact. 
[0070] A first test tested / frame scrambling. Image 2(a) shows an original / frame from the "earphone" sequence. 

45 Image 2(b) shows a corresponding frame after sign bit encryption for the coefficient values and inverse transformation. 
Although the image is greatly distorted, much of the image is still comprehensible (possibly due to the large contribution 
of the DC coefficients that retained their correct sign). It is seen that the shuffling along a slice method with/without sign 
encryption (images 2(c) and 2(d), respectively) renders a completely incomprehensible frame. Also shown in image 
2(e) for comparison purposes is the result obtained with the method of Tang where coefficients are shuffled within an 

so 8x8 block. For this particular sequence, with Tang's method the person in the scene remains somewhat discernable due 
to the uniform darkness of his shirt (shuffling coefficients within blocks will not change the darkness). 
[0071] Table 2 shows the impact of the scrambling approaches on the compression efficiency for / frames. As 
expected, sign encryption has no impact on the compression efficiency. Shuffling along slices with/without sign encryp- 
tion increases the size of the compressed I frame by about 10%. Shuffling within blocks, on the other hand, increases 

55 the size of the frame by more than 1 00%. 
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Seramhlinff Methrvi 


OIZC 

(bits) 


POMP 

(dB) 


No scrambling 


17280 


32.24 


Sign encryption 


17280 


32.24 


Shuffle along slice 


18920 


32.24 


Sign+shuffle along slice 


18920 


32.24 


ShufQe within block [Tang] 


36008 


31.78 



Tabic 2: Impact of different scrambling techniques on compression efficiency for one I 
frame of "earphone" sequence. 



[0072] Table 3 shows the impact of the scrambling approaches on the compressibility of the sequence. Again, the 
sign encryption has no impact on the compression efficiency. The shuffling along slices method with/without sign 
encryption, on the average, increases the bit rate of the compressed sequence by about 20%. This suggests that the 
impact of the shuffling along slices method on compression efficiency is more severe on P frames than on / frames. If 
both shuffling along slices and sign encryption are used for / frames (and intracoded blocks), but only sign encryption 
is used for P frames, then the bit rate of the compressed sequence only increases by 1 .6%. By way of comparison, 
Tang's shuffling within blocks method increases the bit rate by about 50%. 



Scrambling Method 


Bit rate 
(kbit/s) 


PSNR(dB)(P 
frames) 


No scrambling 


27.97 


31.90 


Sign encryption 


27.94 


31.91 


Shuffle along slices 


33.51 


31.90 


Sign+shuffle along slices 


33.70 


31.91 


I(sign+siice)+P(sign) 


28.42 


31.90 


Sign+Slice+MV_sign 


34.59 


31.91 


ICsign-Hsu'ceJ+Pfsign+MV^sign) 


29.33 


31.90 


Shuffle within block [Tang] 


43.40 


31.90 



Table 3: Impact of different scrambling techniques on compression efficiency for 41 
frames (one I frame followed by 40 P frames) of "earphone" sequence. 



[0073] In our experiments, we found that for all scrambling schemes tested, if motion vector information was not 
encrypted, then we could perceive that someone was talking in the scene, although the detail was not visible. We 
believe encryption of motion information may be important for some applications. It is also a very effective way to scram- 
ble P/B frames because the reconstructed P/B frames depend heavily on the accuracy of the motion vectors. 
[0074] Table 3 shows that encrypting the signs of all coefficients and the signs of all motion vectors and only shuf- 
fling along slices for / frames/blocks (l(sign+slice)+P(sign+MV_sign)) provides a very good compromise between secu- 
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rity and coding efficiency. This method only increases the bit rate by 4.6%, and with the encryption of motion vector 
signs incorporated, the video sequence is completely indiscernible. Other combinations of the above mentioned scram- 
bling methods are also possible. For example, the method of shuffling motion vectors within a slice can be combined 
with other coefficient encryption schemes. 

[0075] The encryption system presented in this disclosure can be used as one component of a complete video 
transmission or storage system. It is, in principle, independent from other components such as compression and trans- 
mission. In some circumstances, performance can be improved by integrating the encrypter with another block of a 
coder. For example, a context-predictive coder can make use of a shuffling table to determine the context and the coef- 
ficient coding order based on the "real" location of blocks, thereby reducing the coding inefficiencies introduced by the 
edge effects produced by block shuffling. The tradeoff in such a system is flexibility (e.g., transcodability). 
[0076] One of ordinary skill in the art will recognize that the concepts taught herein can be extended in many other 
obvious and advantageous ways. Such minor modifications are encompassed within the invention, and are intended to 
fall within the scope of the claims. 

Claims 

1 . A method for encrypting a digital image, comprising the steps of: 

applying a space-frequency transform to an original digital image, thereby generating a transform coefficient 
map corresponding to the digital image; and 

encrypting the transform coefficient map, using one or more encryption techniques selected from the group of 
techniques consisting of 

scrambling the sign bits of the transform coefficients, 
scrambling the refinement bits of the transform coefficients, 

partitioning the transform coefficient map into two-dimensional coefficient blocks, and shuffling selected 
blocks within the coefficient map, and 

grouping a set of transform coefficients from a spatial frequency subband, and shuffling the transform 
coefficients within the group. 

2. The method of claim 1 , further comprising the step of entropy coding the encrypted transform coefficient map. 

3. The method of claim 1 , wherein one or more spatial frequency subbands of the original transform coefficient map 
bypass the encrypting step. 

4. A method for encrypting a block of digital image data, the method comprising the steps of: 

selecting a group of bits across the block of image data, the group having lower than average predicted com- 
pressibility as compared to the predicted compressibility of the block of image data as a whole; and 
scrambling the group of bits. 

5. The method of claim 4, wherein the block of digital data comprises a space-frequency transform coeff ident map. 

6. The method of claim 5, wherein the group of bits comprises the sign bits from the transform coefficient map. 

7. The method of claim 5, wherein the group of bits comprises refinement bits from the transform coefficient map. 

8. The method of claim 4, wherein the block of digital data comprises motion-compensation data. 

9. A method for encrypting a digital video stream comprising a motion-compensation data component, said method 
comprising the step of selectively scrambling the motion-compensation data component of the digital video stream 
prior to bitstream coding of the digital video stream. 

10. The method of claim 9, wherein the step of selectively scrambling the motion-compensation data component com- 
prises scrambling the sign bits of the motion-compensation data values. 

1 1 . The method of claim 9, wherein the step of selectively scrambling the motion-compensation data component com- 
prises selecting a group of motion vectors and shuffling the vectors within the group. 
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12. An image encryption system (142) comprising 
an encryption buffer (160), and 

at least one encryption subsystem operating on transform data stored in the encryption buffer (160), the 
5 encryption subsystem selected from the group of subsystems consisting of: 

a sign bit scrambler (1 70) that accepts a space-frequency transform coefficient map having signed coefficients, 
and scrambles the sign bits of the coefficients; 

a block shuffler (1 74) that accepts a space-frequency transform coefficient map and shuffles two-dimensional 
coefficient blocks to pseudorandom locations in the map; 
10 a block rotator (176) that accepts a space-frequency transform coefficient map and rotates two-dimensional 

coefficient blocks to pseudorandom orientations; and 

a coefficient shuffler (1 72) that shuffles transform coefficients from a spatial frequency subband to pseudoran- 
dom locations. 

15 1 3. The encryption system of claim 1 2, further comprising an entropy coder that operates on encrypted transform data 
supplied by the encryption subsystem. 

14. The image encryption system of claim 12, wherein the sign bit scrambler (170) operates on a motion vector having 
signed components. 

20 

1 5. The image encryption system of claim 1 2, wherein the coefficient shuffler (1 72) operates on a group of motion vec- 
tor components. 

16. An encrypted-image decryption system comprising 
a decryption buffer (180), and 

at least one decryption subsystem operating on encrypted transform data stored in the decryption buffer (180), 
the decryption subsystem selected from the group of subsystems consisting of: 

a sign bit descrambler (190) that accepts a space-frequency transform coefficient map and/or a motion vector 
array having encrypted sign bits, and applies a decryption key to the encrypted sign bits to recreate the original 
sign bits; 

a block deshuffler (194) that accepts a space-frequency transform coefficient map having shuffled two-dimen- 
sional coefficient blocks, and applies a decryption key to the coefficient block pattern to restore the blocks to 
their original locations; 

a block derotator (196) that accepts a space-frequency transform coefficient map having rotated two-dimen- 
sional coefficient blocks, and applies a decryption key to the coefficient block pattern to restore the blocks to 
their original orientations; and 

a coefficient deshuffler (1 92) that applies a decryption key to the coefficient pattern having shuffled coefficients 
from a spatial frequency subband to restore the coefficients in the group to their original locations. 

17. The encrypted-image decryption system of claim 16, further comprising an entropy decoder that operates on an 
input bitstream and supplies encrypted transform data to the decryption buffer (180). 

18. The encrypted-image decryption system of claim 16, wherein the sign bit descrambler (190) operates on a motion 
45 vector array having encrypted sign bits. 

19. The encrypted-image decryption system of claim 16, wherein the coefficient deshuffler (192) operates on a motion 
vector array having shuffled motion vector data. 
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